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(57) Abstract 

The present invention provides open, reading frames ORFX, encoding isolated polypeptides, as well as polynucleotides encoding 
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Novel Polynucleotides And Polypeptides Encoded Thereby 



Background of the Invention 

The invention relates generally to nucleic acids and polypeptides encoded thereby, and 
methods of using these nucleic acids and polypeptides. 

Summary of the Invention 

The invention is based in part on the discovery of nucleic acids that include open reading 
frames encoding novel polypeptides, and on the polypeptides encoded thereby. The nucleic 
acids and polypeptides are collectively referred to herein as "ORFX". 

Accordingly, in one aspect, the invention provides an isolated nucleic acid molecule 
(SEQ ID NO:2h-1, wherein n is an integer between 1-3161), that encodes novel polypeptide, or a 
fragment, homolog, analog or derivative thereof. The nucleic acid can include, e.g., a nucleic 
acid sequence encoding a polypeptide at least 85% identical to a polypeptide comprising the 
amino acid sequences of SEQ ID N0:2h, wherein n is an integer between 1-3161. The nucleic 
acid can be, e.g., a genomic DNA fragment, or a cDNA molecule. 

Also included in the invention is a vector containing one or more of the nucleic acids 
described herein, and a cell containing the vectors or nucleic acids described herein. 

The invention is also directed to host cells transformed with a recombinant expression 
vector comprising any of the nucleic acid molecules described above. 

In another aspect, the invention includes a pharmaceutical composition that includes an 
ORFX nucleic acid and a pharmaceutical^ acceptable carrier or diluent. 
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In a further aspect, the invention includes a substantially purified ORF polypeptide, e.g., 
any of the ORFX polypeptides encoded by an ORFX nucleic acid, and fragments, homologs, 
analogs, and derivatives thereof. The invention also includes a pharmaceutical composition that 
includes a ORFX polypeptide and a pharmaceutical^ acceptable carrier or diluent. 
5 In a still a further aspect, the invention provides an antibody that binds specifically to an 

ORFX polypeptide- The antibody can be, e.g., a monoclonal or polyclonal antibody, and 
fragments, homologs, analogs, and derivatives thereof. The invention also includes a 
pharmaceutical composition including ORFX antibody and a pharmaceutically acceptable carrier 
or diluent The invention is also directed to isolated antibodies that bind to an epitope on a 
10 polypeptide encoded by any of the nucleic acid molecules described above. 

The invention also includes kits comprising any of the pharmaceutical compositions 
described above. 

The invention further provides a method for producing an ORFX polypeptide by 
providing a cell containing a ORFX nucleic acid, e.g., a vector that includes a ORFX nucleic 
15 acid, and culturing the cell under conditions sufficient to express the ORFX polypeptide encoded 
by the nucleic acid. The expressed ORFX polypeptide is then recovered from the cell. 
Preferably, the cell produces little or no endogenous ORFX polypeptide. The cell can be, e.g., a 
prokaryotic cell or eukaryotic cell. 

The invention is also directed to methods of identifying an ORFX polypeptide or nucleic 
20 acids in a sample by contacting the sample with a compound that specifically binds to the 
polypeptide or nucleic acid, and detecting complex formation, if present. 

The invention further provides methods of identifying a compound that modulates the 
activity of a ORFX polypeptide by contacting ORFX polypeptide with a compound and 
determining whether the ORFX polypeptide activity is modified. 
25 The invention is also directed to compounds that modulate ORFX polypeptide activity 

identified by contacting a ORFX polypeptide with the compound and determining whether the 
compound modifies activity of the ORFX polypeptide, binds to the ORFX polypeptide, or binds 
to a nucleic acid molecule encoding a ORFX polypeptide. 

In a another aspect, the invention provides a method of determining the presence of or 
30 predisposition of an ORFX-associated disorder in a subject. The method includes providing a 
sample from the subject and measuring the amount of ORFX polypeptide in the subject sample. 
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The amount of ORFX polypeptide in the subject sample is then compared to the amount of 
ORFX polypeptide in a control sample. An alteration in the amount of ORFX polypeptide in the 
subject protein sample relative to the amount of ORFX polypeptide in the control protein sample 
indicates the subject has a tissue proliferation-associated condition. A control sample is 
5 preferably taken from a matched individual, r.e., an individual of similar age, sex, or other 

general condition but who is not suspected of having a tissue proliferation-associated condition. 
Alternatively, the control sample may be taken from the subject at a time when the subject is not 
suspected of having a tissue proliferation-associated disorder. In some embodiments, the ORFX 
is detected using a ORFX antibody. 

10 In a further aspect, the invention provides a method of determining the presence of or 

predisposition of an ORFX-associated disorder in a subject. The method includes providing a 
nucleic acid sample, e.g., RNA or DNA, or both, from the subject and measuring the amount of 
the ORFX nucleic acid in the subject nucleic acid sample. The amount of ORFX nucleic acid 
sample in the subject nucleic acid is then compared to the amount of an ORFX nucleic acid in a 

1 5 control sample. An alteration in the amount of ORFX nucleic acid in the sample relative to the 
amount of ORFX in the control sample indicates the subject has a tissue proliferation-associated 
disorder. 

In a still fiirther aspect, the invention provides method of treating or preventing or 
delaying a ORFX-associated disorder. The method includes administering to a subject in which 

20 such treatment or prevention or delay is desired a ORFX nucleic acid, a ORFX polypeptide, or 
an ORFX antibody in an amount sufficient to treat, prevent, or delay a tissue proliferation- 
associated disorder in the subject. 

Unless otherwise defined, all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 

25 belongs. Although methods and materials similar or equivalent to those described herein can be 
used in the practice or testing of the present invention, suitable methods and materials are 
described below. All publications, patent applications, patents, and other references mentioned 
herein are incorporated by reference in their entirety. In the case of conflict, the present 
specification, including definitions, will control. In addition, the materials, methods, and 

30 examples are illustrative only and not intended to be limiting. 
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Other features and advantages of the invention will be apparent from the following 
detailed description and claims. 

Detailed Description Of The Invention 

The invention provides novel polypeptides and nucleotides encoded thereby. The 
5 polynucleotides and their encoded polypeptides can be grouped according to the functions played 
by their gene products. Such functions include, structural proteins, proteins from which 
associated with metabolic pathways fatty acid metabolism, glycolysis, intermediary metabolism, 
calcium metabolism, proteases, and amino acid metabolism, etc. 

Included in the invention are 3161 novel nucleic acid sequences and their encoded 
1 0 polypeptides. The sequences are collectively referred to as "ORFX nucleic acids" or ORFX 
polynucleotides" and the corresponding encoded polypeptide is referred to as a "ORFX 
polypeptide" or ORFX protein". For example, an ORFX nucleic acid according to the invention 
is a nucleic acid including an ORF1 nucleic acid, and an ORF polypeptide according to the 
invention is a polypeptide that includes the amino acid sequence of an ORF1 polypeptide. 
1 5 Unless indicated otherwise, "ORFX" is meant to refer to any of the ORF 1-3161 sequences 
disclosed herein. 

Table 1 provides a summary of the ORFX nucleic acids and their encoded polypeptides 
are summarized in Table 1. Nucleic acid sequences and polypeptide sequences for ORPX 
nucleic acids according to the invention is provided in the section of the specification entitled 
20 "Disclosed Sequences of ORFX Nucleic Acid and Polypeptide Sequences." 

Column 1 of Table 1, entitled "ORF #", denotes an ORF number assigned to a nucleic 
acid containing an open reading frame according to the invention. 

Column 2 of Table 1, entitled "Internal Identification number (Nucleic Acid Sequence 
Identification Number, Polypeptide Sequence Identification Number), provides an internal 
25 identification number for the indicated ORF, along with sequence identification numbers (SEQ 
ID NOs.) corresponding to the indicated ORF. In general, for an ORF/i according to the 
invention (wherein n is any integer from 1 to 3161), a nucleic acid corresponding to the ORF is 
SEQ ID NO:2>M, and an amino acid sequence encoded by the ORF is SEQ ID NO:2n. For 
example, a nucleic acid sequence corresponding to an ORF1 nucleic acid is SEQ ID NO:l, and a 
30 polypeptide sequence corresponding to an ORF I polypeptide is SEQ ID NO:2. Similarly, a 
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nucleic acid sequence corresponding to an ORF4 nucleic acid is SEQ ID NO:7, and a 
polypeptide sequence corresponding to an ORF4 polypeptide is SEQ ID NO:8; a nucleic acid 
sequence corresponding to an ORF198 nucleic acid sequence is SEQ ID NO:395, and a 
polypeptide sequence corresponding to an ORF198 polypeptide is SEQ ID NO:396. Nucleic acid 
5 sequences and polypeptide sequences for ORFX nucleic acids according to the invention are 

provided in the section of the specification entitled "Disclosed Sequences of ORFX Nucleic Acid 
and Polypeptide Sequences.'* 

Column 2 of Table 1, entitled "Protein Similarity", lists previously described proteins 
that are related to polypeptides encoded by the ORFs. Genbank identifiers for the previously 

10 described proteins are provided. These can be retrieved from http://www.ncbi.nlm.nih.gov/. 

To determine similarity to previously described proteins, polypeptides encoded by ORFX 
DNA sequences were tested using the Framesearch Algorithm against a nonredundant version of 
the GenPept Database from NCBI/Genbank. DNA sequences that had a score of W or above 
(Framesearch algorithm score, Edelman et. al. GCG Genetics) to a known protein were selected. 

15 Open reading frames were extended beyond the region of the protein matched using standard 
DNA translation and codon tables. Novel proteins that lacked a protein match were translated 
against the standard genetic codons and proteins with an ORF at least 80 amino acids and 
containing a Methionine start are included in the Table. 

Column 3 of Table 3, entitled "Protein Domains", lists previously described protein 

20 domains, designated by pfam entries, that are present in polypeptides encoded by the ORFs, 

Also included in column 3 are proteins in which these domains are present The pfam entries can 
be retrieved from http://pfam.wustl.edu/. DNA sequences were translated in all six frames and 
tested using the Hmmer Algorithm against the Pfam Database (References to the algorithm and 
Pfam database can be found at http://pfam.wustl.edu) . Translated DNA sequences that 

25 matched a protein domain entry in the Pfam database AND had a score of 7.5' were selected. 

Column 4 of Table 3, entitled "Protein Classification", lists the type of classification 
assigned for the protein, based on its homology. Examples of proteins in the classification 
include the following proteins: 
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phycoerythrin; an example of a luminescent material includes luminol; examples of 
bioluminescent materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include l25 I, ,3, I, 35 S or 3 H. 

ORFX Recombinant Vectors and Host Cells 

5 Another aspect of the invention pertains to vectors, preferably expression vectors, 

containing a nucleic acid encoding ORFX protein, or derivatives, fragments, analogs or 
homologs thereof. As used herein, the term "vector" refers to a nucleic acid molecule capable of 
transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA segments can 

10 be ligated. Another type of vector is a viral vector, wherein additional DNA segments can be 
ligated into the viral genome. Certain vectors are capable of autonomous replication in a host 
cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication 
and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon introduction into the host cell, and thereby are 

15 replicated along with the host genome. Moreover, certain vectors are capable of directing the 
expression of genes to which they are operatively linked. Such vectors are referred to herein as 
"expression vectors". In general, expression vectors of utility in recombinant DNA techniques 
are often in the form of plasmids. In the present specification, "plasmid" and "vector" can be 
used interchangeably as the plasmid is the most commonly used form of vector. However, the 

20 invention is intended to include such other forms of expression vectors, such as viral vectors 

(e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve 
equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of the 
invention in a form suitable for expression of the nucleic acid in a host cell, which means that the 

25 recombinant expression vectors include one or more regulatory sequences, selected on the basis 
of the host cells to be used for expression, that is operatively linked to the nucleic acid sequence 
to be expressed. Within a recombinant expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the regulatory sequence(s) in a manner that 
allows for expression of the nucleotide sequence (e.g., in an in vitro transcription/translation 

30 system or in a host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to includes promoters, enhancers and other expression control elements 
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(e.g., polyadenylation signals). Such regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in Enzymology 185, Academic Press, 
San Diego, Calif. (1990). Regulatory sequences include those that direct constitutive expression 
of a nucleotide sequence in many types of host cell and those that direct expression of the 
nucleotide sequence only in certain host cells (e.g., tissue-specific regulatory sequences). It will 
be appreciated by those skilled in the art that the design of the expression vector can depend on 
such factors as the choice of the host cell to be transformed, the level of expression of protein 
desired, etc. The expression vectors of the invention can be introduced into host cells to thereby 
produce proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., ORFX proteins, mutant forms of ORFX, fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for expression of 
ORFX in prokaryotic or eukaryotic cells. For example, ORFX can be expressed in bacterial cells 
such as £. coli, insect cells (using baculovixus expression vectors) yeast cells or mammalian 
cells. Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990). Alternatively, the 
recombinant expression vector can be transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaiyotes is most often carried out in K coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded therein, 
usually to the amino terminus of the recombinant protein. Such fusion vectors typically serve 
three purposes: (I) to increase expression of recombinant protein; (2) to increase the solubility of 
the recombinant protein; and (3) to aid in the purification of the recombinant protein by acting as 
a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable separation 
of the recombinant protein from the fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin 
and enterokinase. Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; 
Smith and Johnson (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, Mass.) and 
pRIT5 (Pharmacia, Piscataway, N.J.) that fuse glutathione S-transferase (GST), maltose E 
binding protein, or protein A, respectively, to the target recombinant protein. 



39 



WO 00/58473 



PCTAJS00/08621 



Examples of suitable inducible non-fusion £. coli expression vectors include pTrc 
(Amrann et al, (1988) Gene 69:301-3 15) and pET 1 Id (Studier et al, GENE EXPRESSION 
Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1990) 
60-89). 

5 One strategy to maximize recombinant protein expression in E. coli is to express the 

protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant 
protein. See, Gottesman, Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, Calif. (1990) 1 19-128. Another strategy is to alter the nucleic acid 
sequence of the nucleic acid to be inserted into an expression vector so that the individual codons 

10 for each amino acid are those preferentially utilized in £. coli (Wada et al, (1992) Nucleic Acids 
Res, 20:2 111-2118). Such alteration of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 

In another embodiment, the ORFX expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari, et al, 

15 (1987) EMBO 76:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al, (1987) Gene 54:1 13-123), pYES2 (Invitrogen Corporation, San Diego, Calif.), 
and picZ (InVitrogen Corp, San Diego, Calif.). 

Alternatively, ORFX can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., 

20 SF9 cells) include the pAc series (Smith et al (1983) Mol Cell Biol 3:2156-2165) and the pVL 
series (Lucklow and Summers (1989) Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in mammalian 
cells using a mammalian expression vector. Examples of mammalian expression vectors include 
pCDM8 (Seed (1987) Nature 329:840) and pMT2PC (Kaufman et al. (1987) EMBO J 

25 6: 187-195). When used in mammalian cells, the expression vector's control functions are often 
provided by viral regulatory elements. For example, commonly used promoters are derived from 
polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. For other suitable expression 
systems for both prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et 
al, Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, 

30 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1 989. 
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In another embodiment, the recombinant mammalian expression vector is capable of 
directing expression of the nucleic acid preferentially in a particular cell type (<?.g., 
tissue-specific regulatory elements are used to express the nucleic acid). Tissue-specific 
regulatory elements are known in the art. Non-limiting examples of suitable tissue-specific 

5 promoters include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes Dev 

1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) Adv Immunol 43:235-275), 
in particular promoters of T cell receptors (Winoto and Baltimore (1989) EMBO J 8:729-733) 
and immunoglobulins (Banerji et al. (1983) Cell 33:729-740; Queen and Baltimore (1983) Cell 
33:741-748), neuron-specific promoters {e.g., the neurofilament promoter, Byrne and Ruddle 

10 (1989) PNAS 86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 

230:912-916), and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Pat. No. 
4,873,316 and European Application Publication No. 264,166). Developmentally-regulated 
promoters are also encompassed, e.g., the murine hox promoters (Kessel and Gruss (1990) 
Science 249:374-379) and the a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev 

15 3:537-546). 

The invention further provides a recombinant expression vector comprising a DNA 
molecule of me invention cloned into the expression vector in an antisense orientation. That is, 
the DNA molecule is operatively linked to a regulatory sequence in a manner that allows for 
expression (by transcription of the DNA molecule) of an RNA molecule that is antisense to 

20 ORFX mRNA. Regulatory sequences operatively linked to a nucleic acid cloned in the antisense 
orientation can be chosen that direct the continuous expression of the antisense RNA molecule in 
a variety of ceil types, for instance viral promoters and/or enhancers, or regulatory sequences can 
be chosen that direct constitutive, tissue specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form of a recombinant plasmid, phagemid 

25 or attenuated virus in which antisense nucleic acids are produced under the control of a high 

efficiency regulatory region, the activity of which can be determined by the cell type into which 
the vector is introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub et al., "Antisense RNA as a molecular tool for genetic analysis," 
Reviews-Trends in Genetics, Vol. 1(1) 1986. 

30 Another aspect of the invention pertains to host cells into which a recombinant 

expression vector of the invention has been introduced. The terms "host cell" and "recombinant 
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host cell" are used interchangeably herein. It is understood that such terms refer not only to the 
particular subject cell but to the progeny or potential progeny of such a cell. Because certain 
modifications may occur in succeeding generations due to either mutation or environmental 
influences, such progeny may not, in fact, be identical to the parent cell, but are still included 
5 within the scope of the term as used herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, ORFX protein can be 
expressed in bacterial cells such as E coli, insect cells, yeast or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells). Other suitable host cells are known to those 
skilled in the art. 

10 Vector DNA can be introduced into prokaryotic or eukaryotic cells via conventional 

transformation or transfection techniques. As used herein, the terms "transformation" and 
"transfection" are intended to refer to a variety of art-recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, including calcium phosphate or calcium 
chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, or electroporation. 

15 Suitable methods for transforming or transfecting host cells can be found in Sambrook, et al. 
(Molecular Cloning: A Laboratory Manual. 2nd ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory 
manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
20 expression vector and transfection technique used, only a small fraction of cells may integrate 
the foreign DNA into their genome. In order to identify and select these integrants, a gene that 
encodes a selectable marker (e.g., resistance to antibiotics) is generally introduced into the host 
cells along with the gene of interest. Various selectable markers include those that confer 
resistance to drugs, such as G418, hygromycin and methotrexate. Nucleic acid encoding a 
25 selectable marker can be introduced into a host cell on the same vector as that encoding ORFX 
or can be introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by drug selection (e.g., cells that have incorporated the selectable marker 
gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in culture, can 
30 be used to produce (i.e., express) ORFX protein. Accordingly, the invention further provides 
methods for producing ORFX protein using the host cells of the invention. In one embodiment, 
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acid expression or activity. For example, mutations in a ORFX gene can be assayed in a 
biological sample. Such assays can be used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset of a disorder characterized by or associated 
with ORFX protein, nucleic acid expression or activity. 
5 Another aspect of the invention provides methods for determining ORFX protein, nucleic 

acid expression or ORFX activity in an individual to thereby select appropriate therapeutic or 
prophylactic agents for that individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents {e.g., drugs) for therapeutic or prophylactic 
treatment of an individual based on the genotype of the individual (e.g., the genotype of the 
10 individual examined to determine the ability of the individual to respond to a particular agent.) 

Yet another aspect of the invention pertains to monitoring the influence of agents (e.g., 
drugs, compounds) on the expression or activity of ORFX in clinical trials. 

These and other agents are described in further detail in the following sections. 

Diagnostic Assays 

15 Other conditions in which proliferation of cells plays a role include tumors, restenosis, 

psoriasis, Dupuytren's contracture, diabetic complications, Kaposi's sarcoma and rheumatoid 
arthritis. 

An ORFX polypeptide may be used to identify an interacting polypeptide a sample or 
tissue. The method comprises contacting the sample or tissue with ORFX, allowing formation of 
20 a complex between the ORFX polypeptide and the interacting polypeptide, and detecting the 
complex, if present. 

The proteins of the invention may be used to stimulate production of antibodies 
specifically binding the proteins. Such antibodies may be used in immunodiagnostic procedures 
to detect the occurrence of the protein in a sample. The proteins of the invention may be used to 

25 stimulate cell growth and cell proliferation in conditions in which such growth would be 

favorable. An example would be to counteract toxic side effects of chemotherapeutic agents on, 
for example, hematopoiesis and platelet formation, linings of the gastrointestinal tract, and hair 
follicles. They may also be used to stimulate new cell growth in neurological disorders 
including, for example, Alzheimer's disease. Alternatively, antagonistic treatments may be 

30 administered in which an antibody specifically binding the ORFX -like proteins of the invention 
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would abrogate the specific growth-inducing effects of the proteins. Such antibodies may be 
useful, for example, in the treatment of proliferative disorders including various tumors and 
benign hyperplasias. 

Polynucleotides or oligonucleotides corresponding to any one portion of the ORFX 
5 nucleic acids of SEQ ID NO:2n-l (wherein n = 1 to 3161) may be used to detect DNA 

containing a corresponding ORF gene, or detect the expression of a corresponding ORFX gene, 
or ORFX-like gene. For example, an ORFX nucleic acid expressed in a particular cell or tissue, 
as noted in Table 2, can be used to identify the presence of that particular cell type. 

An exemplary method for detecting the presence or absence of ORFX in a biological 

10 sample involves obtaining a biological sample from a test subject and contacting the biological 
sample with a compound or an agent capable of detecting ORFX protein or nucleic acid (e.g., 
mRNA, genomic DNA) that encodes ORFX protein such that the presence of ORFX is detected 
in the biological sample. An agent for detecting ORFX mRNA or genomic DNA is a labeled 
nucleic acid probe capable of hybridizing to ORFX mRNA or genomic DNA. The nucleic acid 

15 probe can be, for example, a full-length ORFX nucleic acid, such as the nucleic acid of SEQ ID 
NO:2n-l (wherein n = 1 to 3161), or a portion thereof, such as an oligonucleotide of at least 15, 
30, 50, 100, 250 or 500 nucleotides in length and sufficient to specifically hybridize under 
stringent conditions to ORFX mRNA or genomic DNA, as described above. Other suitable 
probes for use in the diagnostic assays of the invention are described herein. 

20 An agent for detecting ORFX protein is an antibody capable of binding to ORFX protein, 

preferably an antibody with a detectable label. Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof (e.g. , Fab or F(ab')2) can be used. The 
term "labeled", with regard to the probe or antibody, is intended to encompass direct labeling of 
the probe or antibody by coupling (i.e., physically linking) a detectable substance to the probe or 

25 antibody, as well as indirect labeling of the probe or antibody by reactivity with another reagent 
that is directly labeled. Examples of indirect labeling include detection of a primary antibody 
using a fluorescently labeled secondary antibody and end-labeling of a DNA probe with biotin 
such that it can be detected with fluorescently labeled streptavidin. The term "biological sample" 
is intended to include tissues, cells and biological fluids isolated from a subject, as well as 

30 tissues, cells and fluids present within a subject. That is, the detection method of the invention 
can be used to detect ORFX mRNA, protein, or genomic DNA in a biological sample in vitro as 
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well as in vivo. For example, in vitro techniques for detection of ORFX mRNA include Northern 
hybridizations and in situ hybridizations. In vitro techniques for detection of ORFX protein 
include enzyme linked immunosorbent assays (ELISAs), Western blots, immunoprecipitations 
and immunofluorescence. In vitro techniques for detection of ORFX genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques for detection of ORFX protein include 
introducing into a subject a labeled anti-ORFX antibody. For example, the antibody can be 
labeled with a radioactive marker whose presence and location in a subject can be detected by 
standard imaging techniques. 

In one embodiment, the biological sample contains protein molecules from the test 
subject Alternatively, the biological sample can contain mRNA molecules from the test subject 
or genomic DNA molecules from the test subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means from a subject. 

In another embodiment, the methods further involve obtaining a control biological 
sample from a control subject, contacting the control sample with a compound or agent capable 
of detecting ORFX protein, mRNA, or genomic DNA, such that the presence of ORFX protein, 
mRNA or genomic DNA is detected in the biological sample, and comparing the presence of 
ORFX protein, mRNA or genomic DNA in the control sample with the presence of ORFX 
protein, mRNA or genomic DNA in the test sample. 

The invention also encompasses kits for detecting the presence of ORFX in a biological 
sample. For example, the kit can comprise: a labeled compound or agent capable of detecting 
ORFX protein or mRNA in a biological sample; means for determining the amount of ORFX in 
the sample; and means for comparing the amount of ORFX in the sample with a standard. The 
compound or agent can be packaged in a suitable container. The kit can further comprise 
instructions for using the kit to detect ORFX protein or nucleic acid. 

Prognostic Assays 

The diagnostic methods described herein can furthermore be utilized to identify subjects 
having or at risk of developing a disease or disorder associated with aberrant ORFX expression 
or activity. For example, the assays described herein, such as the preceding diagnostic assays or 
the following assays, can be utilized to identify a subject having or at risk of developing a 
disorder associated with ORFX protein, nucleic acid expression or activity in, e.g., proliferative 
or differentiative disorders such as hyperplasias, tumors, restenosis, psoriasis, Dupuytren's 
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contracture, diabetic complications, or rheumatoid arthritis, etc.; and glia-associated disorders 
such as cerebral lesions, diabetic neuropathies, cerebral edema, senile dementia, Alzheimer's 
disease, etc. Alternatively, the prognostic assays can be utilized to identify a subject having or at 
risk for developing a disease or disorder. Thus, the present invention provides a method for 

5 identifying a disease or disorder associated with aberrant ORFX expression or activity in which a 
test sample is obtained from a subject and ORFX protein or nucleic acid (e.g., mRNA, genomic 
DNA) is detected, wherein the presence of ORFX protein or nucleic acid is diagnostic for a 
subject having or at risk of developing a disease or disorder associated with aberrant ORFX 
expression or activity. As used herein, a "test sample" refers to a biological sample obtained 

1 0 from a subject of interest. For example, a test sample can be a biological fluid (e.g. , serum), cell 
sample, or tissue. 

Furthermore, the prognostic assays described herein can be used to determine whether a 
subject can be administered an agent (e.g., an agonist, antagonist, peptidomimetic, protein, 
peptide, nucleic acid, small molecule, or other drug candidate) to treat a disease or disorder 

15 associated with aberrant ORFX expression or activity. For example, such methods can be used 
to determine whether a subject can be effectively treated with an agent for a disorder, such as a 
proliferative disorder, differentiative disorder, glia-associated disorders, etc. Thus, the present 
invention provides methods for determining whether a subject can be effectively treated with an 
agent for a disorder associated with aberrant ORFX expression or activity in which a test sample 

20 is obtained and ORFX protein or nucleic acid is detected (e.g. , wherein the presence of ORFX 
protein or nucleic acid is diagnostic for a subject that can be administered the agent to treat a 
disorder associated with aberrant ORFX expression or activity.) 

The methods of the invention can also be used to detect genetic lesions in a ORFX gene, 
thereby determining if a subject with the lesioned gene is at risk for, or suffers from, a 

25 proliferative disorder, differentiative disorder, glia-associated disorder, etc. In various 

embodiments, the methods include detecting, in a sample of cells from the subject, the presence 
or absence of a genetic lesion characterized by at least one of an alteration affecting the integrity 
of a gene encoding a ORFX-protein, or the mis-expression of the ORFX gene. For example, 
such genetic lesions can be detected by ascertaining the existence of at least one of (1) a deletion 

30 of one or more nucleotides from a ORFX gene; (2) an addition of one or more nucleotides to a 
ORFX gene; (3) a substitution of one or more nucleotides of a ORFX gene, (4) a chromosomal 
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<210> 5611 

<211> 11S2 

<212> DNA 

<213> Homo sapiens 

<400> 5611 

ngggccgctc cctcccggac tcccggcctc ccggcctccc tggtcccgcc tgggaaggga 
60 

tgcaaggaag ccctccggcg ctgcgctccg aggcgggaga cagcgtcccc ctccgcccct 
120 
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cgggtcctgg cgcctcagag cccggcccag gccgcggaac ggtgatgctc gggccggacg 
180 

ggcgagcgcg gatccctgcg tcccgctgaa aatgtgtgtc tgacatgcaa gctcagtggg 
240 

gcagagaccc gtggattgct gtgccctgcc ctccggacct ggatcatgaa ggtgttggga 
300 

agaagcttct tctgggtgct gtttcccgtc cttccctggg cggtgcaggc tgtggagcac 
360 

gaggaggtgg cgcagcgtgt gatcaaactg caccgcgggc gaggggtggc tgccatgcag 
420 

agccggcagt gggtccggga cagctgcagg aagctctcag ggcttctccg ccagaagaat 
480 

gcagttctga acaaactgaa aactgcaatt ggagcagtgg agaaagacgt gggcctgtcg 
540 

gatgaagaga aactgtttca ggtgcacacg tttgaaattt tccagaaaga gctgaatgaa 
600 

agtgaaaatt ccgttttcca agctgtctac ggactgcaga gagccctgca gggggattac 
660 

aaagatgtcg tgaacatgaa ggagagcagc cggcagcgcc tggaggccct gagagaggct 
720 

gcaataaagg aagaaacaga atatatggaa cttctggcag cagaaaaaca tcaagttgaa 
780 

gcccttaaaa atatgcaaca tcaaaaccaa agtttatcca tgcttgacga gattcttgaa 
840 

gatgtaagaa aggcagcgga tcgtctggag gaagagatag aggaacatgc ttttgacgac 
900 

aataaatcag tcaagggggt caattttgag gcagttctga gggtggagga agaagaggcc 
960 

aattctaagc aaaatataac aaaacgagaa gtggaggatg acttggttct tagcatgctg 
1020 

attgactccc agaacaacca gtatattttg accaagccca gagattcaac catcccacgt 
1080 

gcagatcacc actttataaa ggacattgtt accataggaa tgctgtcttt gccttgtggc 
1140 

tggcgatgta ca 
1152 
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<212> PRT 

<213> Homo sapiens 

<400> 5612 

Met Lys Val Leu Gly Arg Ser Phe Phe Trp Val Leu Phe Pro Val Leu 

15 10 15 

Pro Trp Ala Val Gin Ala Val Glu His Glu Glu Val Ala Gin Arg Val 

20 25 30 

He Lys Leu His Arg Gly Arg Gly Val Ala Ala Met Gin Ser Arg Gin 

35 40 45 

Trp Val Arg Asp Ser Cys Arg Lys Leu Ser Gly Leu Leu Arg Gin Lys 

50 55 60 

Asn Ala Val Leu Asn Lys Leu Lys Thr Ala He Gly Ala Val Glu Lys 
65 70 75 80 

Asp Val Gly Leu Ser Asp Glu Glu Lys Leu Phe Gin Val His Thr Phe 
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